Search CORE

139 research outputs found

UniMSE: Towards Unified Multimodal Sentiment Analysis and Emotion Recognition

Author: Hu Guimin
Li Yongbin
Lin Ting-En
Lu Guangming
Wu Yuchuan
Zhao Yi
Publication venue
Publication date: 21/11/2022
Field of study

Multimodal sentiment analysis (MSA) and emotion recognition in conversation (ERC) are key research topics for computers to understand human behaviors. From a psychological perspective, emotions are the expression of affect or feelings during a short period, while sentiments are formed and held for a longer period. However, most existing works study sentiment and emotion separately and do not fully exploit the complementary knowledge behind the two. In this paper, we propose a multimodal sentiment knowledge-sharing framework (UniMSE) that unifies MSA and ERC tasks from features, labels, and models. We perform modality fusion at the syntactic and semantic levels and introduce contrastive learning between modalities and samples to better capture the difference and consistency between sentiments and emotions. Experiments on four public benchmark datasets, MOSI, MOSEI, MELD, and IEMOCAP, demonstrate the effectiveness of the proposed method and achieve consistent improvements compared with state-of-the-art methods.Comment: Accepted to EMNLP 2022 main conferenc

arXiv.org e-Print Archive

UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt

Author: Cai Yucheng
Li Yongbin
Ma Wentao
Ou Zhijian
Shao Yuan
Si Shuzheng
Wu Yuchuan
Publication venue
Publication date: 20/09/2023
Field of study

Recent research has shown that multi-task pre-training greatly improves the model's robustness and transfer ability, which is crucial for building a high-quality dialog system. However, most previous works on multi-task pre-training rely heavily on human-defined input format or prompt, which is not optimal in quality and quantity. In this work, we propose to use Task-based Automatic Prompt generation (TAP) to automatically generate high-quality prompts. Using the high-quality prompts generated, we scale the corpus of the pre-trained conversation model to 122 datasets from 15 dialog-related tasks, resulting in Universal Pre-trained Conversation Model (UniPCM), a powerful foundation model for various conversational tasks and different dialog systems. Extensive experiments have shown that UniPCM is robust to input prompts and capable of various dialog-related tasks. Moreover, UniPCM has strong transfer ability and excels at low resource scenarios, achieving SOTA results on 9 different datasets ranging from task-oriented dialog to open-domain conversation. Furthermore, we are amazed to find that TAP can generate prompts on par with those collected with crowdsourcing. The code is released with the paper

arXiv.org e-Print Archive

UniSA: Unified Generative Framework for Sentiment Analysis

Author: Li Yongbin
Li Zaijing
Lin Ting-En
Liu Meng
Tang Fengxiao
Wu Yuchuan
Zhao Ming
Publication venue
Publication date: 03/09/2023
Field of study

Sentiment analysis is a crucial task that aims to understand people's emotional states and predict emotional categories based on multimodal information. It consists of several subtasks, such as emotion recognition in conversation (ERC), aspect-based sentiment analysis (ABSA), and multimodal sentiment analysis (MSA). However, unifying all subtasks in sentiment analysis presents numerous challenges, including modality alignment, unified input/output forms, and dataset bias. To address these challenges, we propose a Task-Specific Prompt method to jointly model subtasks and introduce a multimodal generative framework called UniSA. Additionally, we organize the benchmark datasets of main subtasks into a new Sentiment Analysis Evaluation benchmark, SAEval. We design novel pre-training tasks and training methods to enable the model to learn generic sentiment knowledge among subtasks to improve the model's multimodal sentiment perception ability. Our experimental results show that UniSA performs comparably to the state-of-the-art on all subtasks and generalizes well to various subtasks in sentiment analysis.Comment: Accepted to ACM MM 202

arXiv.org e-Print Archive

Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models

Author: Gao Haoyu
Li Hangyu
Li Yongbin
Lin Ting-En
Ma Wentao
Wu Yuchuan
Yang Min
Publication venue
Publication date: 22/09/2023
Field of study

Task-oriented dialogue (TOD) systems facilitate users in executing various activities via multi-turn dialogues, but Large Language Models (LLMs) often struggle to comprehend these intricate contexts. In this study, we propose a novel "Self-Explanation" prompting strategy to enhance the comprehension abilities of LLMs in multi-turn dialogues. This task-agnostic approach requires the model to analyze each dialogue utterance before task execution, thereby improving performance across various dialogue-centric tasks. Experimental results from six benchmark datasets confirm that our method consistently outperforms other zero-shot prompts and matches or exceeds the efficacy of few-shot prompts, demonstrating its potential as a powerful tool in enhancing LLMs' comprehension in complex dialogue tasks

arXiv.org e-Print Archive

Detection of Flare-induced Plasma Flows in the Corona of EV Lac with X-ray Spectroscopy

Author: Chen Hechao
Hou Zhenyong
Li Hao
Lu Hongpeng
Tian Hui
Wang Jianguo
Wu Yuchuan
Xu Yu
Publication venue: 'American Astronomical Society'
Publication date: 27/05/2022
Field of study

Stellar flares are characterized by sudden enhancement of electromagnetic radiation from the atmospheres of stars. Compared to their solar counterparts, our knowledge on the coronal plasma dynamics of stellar flares and their connection to coronal mass ejections (CMEs) remains very limited. With time-resolved high-resolution spectroscopic observations from the \textit{Chandra} X-ray observatory, we detected noticeable coronal plasma flows during several stellar flares on a nearby dMe star EV Lac. In the observed spectra of O~{\sc{viii}} (3 MK), Fe~{\sc{xvii}} (6 MK), Mg~{\sc{xii}} (10 MK), and Si~{\sc{xiv}} (16 MK) lines, these flare-induced upflows/downflows appear as significant Doppler shifts of several tens to \speed{130}, and the upflow velocity generally increases with temperature. Variable line ratios of the Si~{\sc{xiii}} triplet reveal that these plasma flows in most flares are accompanied by an increase of the coronal plasma density and temperature. We interpret these results as X-ray evidences for chromospheric evaporation on EV Lac. In two successive flares, the plasma flow pattern and a sharp increase of the measured coronal density are highly suggestive of explosive evaporation. The transition from redshifts to blueshifts in such an explosive evaporation occurs at a temperature of at least 10 MK, much higher than that observed in solar flares (

\sim

1 MK). However, in one flare the cool and warm upflows appear to be accompanied by a decreasing plasma density, which might be explained by a stellar filament/prominence eruption coupled to this flare. These results provide important clues to understand the coronal plasma dynamics during flares on M dwarfs.Comment: accepted by Ap

arXiv.org e-Print Archive

CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog Evaluation

Author: An Zhongqi
Cao Zheng
Dai Yinpei
He Wanwei
Li Bowen
Li Yongbin
Sun Jian
Wu Yuchuan
Publication venue
Publication date: 21/11/2022
Field of study

Practical dialog systems need to deal with various knowledge sources, noisy user expressions, and the shortage of annotated data. To better solve the above problems, we propose CGoDial, new challenging and comprehensive Chinese benchmark for multi-domain Goal-oriented Dialog evaluation. It contains 96,763 dialog sessions and 574,949 dialog turns totally, covering three datasets with different knowledge sources: 1) a slot-based dialog (SBD) dataset with table-formed knowledge, 2) a flow-based dialog (FBD) dataset with tree-formed knowledge, and a retrieval-based dialog (RBD) dataset with candidate-formed knowledge. To bridge the gap between academic benchmarks and spoken dialog scenarios, we either collect data from real conversations or add spoken features to existing datasets via crowd-sourcing. The proposed experimental settings include the combinations of training with either the entire training set or a few-shot training set, and testing with either the standard test set or a hard test subset, which can assess model capabilities in terms of general prediction, fast adaptability and reliable robustness.Comment: EMNLP 202

arXiv.org e-Print Archive

Hypothermia in Stroke Therapy: Systemic versus Local Application

Author: Chandra Ankush
Ding Yuchuan
Duan Hong Lian
Geng Xiaokun
Guan Longfei
Huber Mitchell
Li Fengwu
Wu Longfei
Publication venue: 'IntechOpen'
Publication date: 01/02/2017
Field of study

Presently, there are no effective, widely applicable therapies for ischemic stroke. There is strong clinical evidence for the neuroprotective benefits of hypothermia, and surface-cooling methods have been utilized for decades in the treatment of cerebral ischemia during cardiac arrest, but complications with hypothermia induction have hindered its clinical acceptance in ischemic stroke therapy. Recently, the microcatheter-based local endovascular infusion (LEVI) of cold saline directly to the infarct site has been proposed as a solution to the drawbacks of surface cooling. The safety and efficacy of LEVI in rat models have been established, and implementation in larger animals has been similarly encouraging. A recent pilot study even established the safety of LEVI in humans. This review seeks to outline the major research on LEVI, discusses the mechanisms that mediate its superior neuroprotection over surface and systemic cooling, and identifies areas that warrant further investigation. While LEVI features improvements on surface cooling, its core mechanisms of neuroprotection are still largely shared with therapeutic hypothermia in general. As such, the mechanisms of hypothermia-based neuroprotection are discussed as well

IntechOpen

Improving Factual Consistency of Text Summarization by Adversarially Decoupling Comprehension and Embellishment Abilities of LLMs

Author: Fan Yan
Feng Huawen
Huang Fei
Li Yongbin
Lin Ting-En
Liu Xiong
Ma Qianli
Wu Yuchuan
Yao Zekun
Publication venue
Publication date: 14/11/2023
Field of study

Despite the recent progress in text summarization made by large language models (LLMs), they often generate summaries that are factually inconsistent with original articles, known as "hallucinations" in text generation. Unlike previous small models (e.g., BART, T5), current LLMs make fewer silly mistakes but more sophisticated ones, such as imposing cause and effect, adding false details, overgeneralizing, etc. These hallucinations are challenging to detect through traditional methods, which poses great challenges for improving the factual consistency of text summarization. In this paper, we propose an adversarially DEcoupling method to disentangle the Comprehension and EmbellishmeNT abilities of LLMs (DECENT). Furthermore, we adopt a probing-based efficient training to cover the shortage of sensitivity for true and false in the training process of LLMs. In this way, LLMs are less confused about embellishing and understanding; thus, they can execute the instructions more accurately and have enhanced abilities to distinguish hallucinations. Experimental results show that DECENT significantly improves the reliability of text summarization based on LLMs

arXiv.org e-Print Archive

Molecular doping enabled scalable blading of efficient hole-transport-layer-free perovskite solar cells

Author: Deng Yehao
Fang Yanjun
Gruverman Alexei
Huang Jinsong
Li Tao
Liu Ye
Lu Haidong
Shao Yuchuan
Tang Shi
Wang Qi
Wu Wu-Qiang
Yang Zhibin
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2018
Field of study

The efficiencies of perovskite solar cells (PSCs) are now reaching such consistently high levels that scalable manufacturing at low cost is becoming critical. However, this remains challenging due to the expensive hole-transporting materials usually employed, and difficulties associated with the scalable deposition of other functional layers. By simplifying the device architecture, hole-transport-layer-free PSCs with improved photovoltaic performance are fabricated via a scalable doctor-blading process. Molecular doping of halide perovskite films improved the conductivity of the films and their electronic contact with the conductive substrate, resulting in a reduced series resistance. It facilitates the extraction of photoexcited holes from perovskite directly to the conductive substrate. The bladed hole-transport-layerfree PSCs showed a stabilized power conversion efficiency above 20.0%. This work represents a significant step towards the scalable, cost-effective manufacturing of PSCs with both high performance and simple fabrication processes

Crossref

DigitalCommons@University of Nebraska

Directory of Open Access Journals

Carolina Digital Repository